On the Hellmann-Feynman theorem for statistical 

averages 



Francisco M. Fernandez 

INIFTA (UNLP, CCT La Plata-CONICET), Division Quimica Teorica, 
Blvd. 113 y 64 (S/N), Sucursal 4, Casilla de Correo 16, 
1900 La Plata, Argentina 



Abstract 

We discuss the Hellmann-Feynman theorem for degenerate states and its 
application to the calculation of the derivatives of statistical averages with 
respect to external parameters. 



Some time ago there was a discussion about the validity of the Hellmann- 

n nn 

Feynman theorem (HFT)[1] for degenerate states [2|-|5|]. Recently, some of 
those results 3|-|5( proved useful in deriving an expression for the derivative 
of the non-extensive free energy with respect to an external parameter 6 ] . 
The author took into account the possible occurrence of degenerate states 

n 

in the proof of his Lema l(6j. However, in the proof of his Theorem 2 
he appears to assume that the eigenvalues of H and of the observable A 
are nondegenerate 6[] . We think that this discrepancy should be analyzed 
carefully In this letter we investigate the connection between the HFT for 
quantum-mechanical expectation values and statistical averages when there 
are degenerate states. 
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The starting point of our discussion is the Schrodinger equation 

H^m = E m ^ m 



(1) 



where m is a set of quantum numbers that completely specify the stationary 
state ip m and we assume that (ip n \ ip m ) = 5 mn . If the Hamiltonian operator 
H depends on a parameter A then its eigenvalues and eigenvectors will also 
depend on it. Following Rastegin[6j] we assume that the spectrum of H is 
discrete. 

The Hellmann-Feynman theorem for nondegenerate states does not present 
any difficulty and for this reason we assume that the energy level E n is g„-fold 
degenerate: 

Htpni — E n iip n i, E n { = E n} i = 1,2,..., g n (2) 

If we differentiate this equation with respect to A and then apply the bra 
(iftnjl from the left, we obtain 

(Vwl^-Nvii) = ^§j^ s n ( 3 ) 

This equation tells us that there is a set of degenerate eigenvectors for which 
the diagonal HFT (i = j) is always valid. For simplicity we avoid a detailed 
discussion of the differentiation of eigenvectors and operators with respect to 
the external parameter; in this respect we follow earlier approaches to the 
subject js-5]. 

It is convenient to analyse two different cases separately. The simpler 
one takes place when g n does not change with A (at least for all values of 
physical interest of this external parameter). Any unitary transformation of 



2 



the degenerate states 



Xi = ^Cjiipnj, i = 1,2,..., g n (4) 
yields a set of g n eigenvectors of H with eigenvalue E n . They satisfy 

k=l 

Since g n does not change with A it is obvious that = for all A; = 
1, 2, . . . , g n and this equation simplifies to 

= ^ (6) 

that is similar to ([3]). In other words: in this case we do not have to worry 
about choosing a particular set of eigenvectors and all the results derived by 



Rastegin[6j apply to any observable provided that degeneracy is not removed 
through variations of A. 



When g n changes, for example at A = Aq, then ^J^- v ^ 



A=A 



ax 



for 

A=A 



some i j and Eq. ([6]) does not follow from Eq. (jHJ). However, in this case 
we can derive the equation [3| 

E (x^ ix*> = E -ar (7) 
i=i K=i 

that was invoked by Rastegin 6|] to prove his Lemma 1. Typically, g n (X) < 
gn{^o) which happens, for example, when the symmetry of the system is 
greater when A = Ao- 

Before discussing the trace averages that currently appear in statistical 
mechanics, it is convenient to analyse this problem from another point of 
view. If we differentiate Eq. (CQ) with respect to A and then apply the bra 



(if) n \ from the left, we obtain an expression for both the diagonal (m = n) 
and off- diagonal (m ^ n ) hftQ 

dR 



i , i 9H . . , , . - dip, 

\Wn\ \Vm) = [E m - E n ) {lf) n \ 



m \ '~' J -'m c 



(8) 



If the eigenvalues E m and E n are degenerate at A = A E m (X ) = R n (\ ) 
then 

$mn (9) 



(tpn\ ^ IV'm) 



A=A 



A=A 



This equation is identical to Eq. (j3J) but its derivation reveals that the diag- 
onal HFT applies to degenerate states provided that we choose the eigenvec- 
tors of H according to 

^„(Ao) = lim Vn(A) (10) 

A— >Ao 

In what follows we analyse the HFT in the context of statistical-averages 
that we develop in a somewhat more general setting than that considered by 
Rastegin[6j. For any Hermitian operator W that commutes with H: 



H,W 







(11) 



the hypervirial theorem 



tells us that 



If the trace 



(^| H,W \^ s ) = {E i -E i ){^ i \W\^ j ) 



{ipi\W 1^ = if Rij^Rj 



(12) 



(13) 



tr (w^Pj = EE (<p<\&M W^Wi) (i4) 



4 



exists then it is invariant under unitary transformations of the basis set and 
we can thus choose the eigenvectors of H (fTU|) that satisfy Eq. (jUJ). Since 
they also satisfy Eq. ffl3|) we have 

tr iw— = x: m w m m m = e m w w ^ as) 



This expression is the basis for many of the results derived by Rasteginpj 
such as, for example, his Lemma 1: 



tr 



f(H) 



OH 
~dX 



(16) 



We appreciate that we do not have to worry about degeneracy when calcu- 
lating traces provided that W is diagonal with respect to the nondegenerate 
eigenvectors of H. In other words: we do not need to invoke the HFT sum 
expression (J7|). 



Equation ([15]) also applies to any operator A that depends on a parameter 
A, exhibits a discrete spectrum and commutes with W. Following Rasteginjc] 
we choose an element of the complete set of commuting observables that 
shares a common eigenbasis with H 



II 



(17) 



In such a case we have 



provided that 



tr (w^=Y j mW\^ n ) 



da n 
dX 



(V»n| ^ IV'm) 



da r 



A=A 



dX 



A=A 



when E m (X ) = E n (X ) and E m (X) ^ E n (X) for A ^ A c 



(18) 



(19) 



Rastegin's equations (27) and (31) that are necessary for proving his 
theorem 2 6( require that the eigenvectors satisfy present equations ([9]) and 
f|T9|) when g n changes at Ao- If one does not state these conditions explicitly 
then one is in principle assuming that g n does not change with A and the 
resulting theorems are not so widely applicable. 
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